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Abstract: 

We study the approximation of univariate and multivariate set-valued functions (SVFs) by the 
adaptation to SVFs of positive sample-based approximation operators for real- valued functions. To this 
end, we introduce a new weighted average of several sets and study its properties. The approximation 
results are obtained in the space of Lebesgue measurable sets with the symmetric difference metric. 

In particular, we apply the new average of sets to adapt to SVFs the classical Bernstein approxi- 
mation operators, and show that these operators approximate continuous SVFs. The rate of approx- 
imation of Holder continuous SVFs by the adapted Bernstein operators is studied and shown to be 
asymptotically equal to the one for real-valued functions. Finally, the results obtained in the metric 
space of sets are generalized to metric spaces endowed with an average satisfying certain properties. 

1. Introduction 

Set-valued functions (SVFs) have various applications in optimization, control theory, mathematical eco- 
nomics and other areas. The approximation of SVFs from a finite number of samples has been the subject 
of several recent research works ([3], [12], [13], [19]) and reviews ([11], [24]). 

In order to adapt to SVFs sample-based approximation methods known for real-valued functions, it is 
required to define linear combinations of two or more sets. For most approximation methods it is sufficient 
to consider linear combinations with weights summing up to one, while for positive approximation operators 
only convex combinations (non-negative weights summing up to one) are considered. We term convex linear 
combinations as weighted averages. 

In case of data sampled from a SVF mapping real-numbers to convex sets, methods based on the classical 
Minkowski sum of sets can be used for the approximation [8, 27]. In this approach, sums of numbers in 
positive operators for real- valued approximation are replaced by Minkowski sums of sets. A generalization to 
sets which are either convex or differences of convex sets is done in [2] , where convex sets are embedded into 
the Banach space of directed sets. This approach allows to apply existing methods for the approximation in 
Banach spaces [3] . 

Approximation of set-valued functions mapping real-numbers to general sets is a more challenging task. 
In this case, methods based on Minkowski sum of sets fail to approximate the sampled function [27, 10], and 
other weighted averages of sets are needed. 

Artstein [1] introduced a weighted average of two sets with the property that the Hausdorff metric between 
the average and any of the averaged sets changes linearly with the weight of the average. This average was 
later termed as the metric average of sets. An extension of the metric average to the weighted average of 
several sets, named the metric linear combination, is given in[12]. The metric linear combinations were used 
in [12] to adapt to sets positive and non-positive approximation operators, with the approximation error 
measured in the Hausdorff metric. However, the metric linear combination is applicable only to ordered 
sequences of sets, which limits its usage to the approximation of univariate SVFs. 

As it is noticed in [1], the particular choice of a metric is crucial to the construction and analysis of 
set-valued approximation methods. While previous works develop and analyze set-valued approximation 
methods in the metric space of compact sets endowed with the Hausdorff metric, we consider here the 
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approximation problem in the metric space of Lebesgue measurable sets with the symmetric difference 
metric 1 . The symmetric difference metric allows to obtain approximation results for a wider class of functions, 
as is demonstrated in [19], where set- valued subdivision techniques are investigated. 

In this work, we consider the adaptation of positive sample-based approximation operators to univariate 
and multivariate SVFs . The adaptation is based on a new weighted average of several sets, termed the 
partition average, which is studied in details. 

As it is very well known, the concept of the weighted average of numbers is closely related to that of the 
mathematical expectation of a discrete random variable. Similarly, a weighted average of several sets may be 
interpreted as the expectation of a random set [23]. We use tools from the theory of random sets to prove 
properties of the partition average of sets. 

First, we adapt to SVFs the classical Bernstein operators, and show that these operators approximate 
continuous SVFs. Furthermore, we consider the rate of approximation of Holder continuous SVFs by set- 
valued Bernstein operators, and obtain a result for SVFs analogous to that of Kac [17, 18] for real-valued 
functions. Moreover, we show that the adaptation to SVFs of the classical de Casteljau's algorithm (see, e.g. 
[14], Chapter 4) yields another sequence of adapted operators having the same rate of approximation as that 
for the adapted Bernstein operators. 

The results for Bernstein operators are then extended to general positive sample-based operators. More- 
over, we study the application of positive sample-based operators to monotone SVFs, and show that the 
adapted operator is monotonicity preserving if and only if the corresponding operator for real-valued func- 
tions is monotonicity preserving. 

Due to the commutativity of the partition average of sets, the results are easily generalized to approxi- 
mation operators for multivariate SVFs. Finally, we generalize the approximation results to functions with 
values in metric spaces endowed with a weighted average, with properties similar to those of the partition 
average of sets. 

The structure of this work is as follows. In Section 2, we survey definitions and results relevant to our 
work. In Section 3, we introduce the partition average of sets and study its properties. In Section 4, we 
adapt to sets the Bernstein approximation operators. In section 5, we study another type of set-valued 
Bernstein operators, obtained by adapting to sets of the de Casteljau's algorithm . In Section 6, we consider 
the adaptation to SVFs of positive sample-based operators. In Section 7, we discuss the approximation of 
monotone SVFs. The approximation of multi-variate SVFs is the subject of Section 8. Finally in Section 9, 
we generalize the results to functions with values in general metric spaces. 

2. Preliminaries 

2.1. Sets and the symmetric difference metric 

We denote by /i the m-dimensional Lebesgue measure and by £ the collection of Lebesgue measurable subsets 
of W n having finite measure. The set difference of two sets A, B is 

A\B = {p:peA,p^B} , 

and the symmetric difference is defined by 

AAB = A\B\jB\A . 

The measure of the symmetric difference of A, B 6 £, 

(A, B) = n (AAB) , 

induces a pseudo-metric on £, and (£, d^) is a complete metric space by regarding any two sets A, B such 
that \i (AAB) = as equal ([16], Chapter 8). For A,B £ £,, such that B C A, it is easy to observe that 

d li {A,B) = l x{A\B)= l x(A)- l x(B) . (2.1) 

1 The measure of the symmetric difference is only a pseudo-metric on Lebesgue measurable sets. The metric space is obtained 
in a standard way as described in Section 2 
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We use the notation ci (A) for the closure of the interior of A. A bounded set A, such that A = - ci (A) is 
called regular compact. Regular compact sets are closed under finite unions, but not under finite intersections, 
yet for A, B regular compact sets such that B C A, 

Af]B = B = ci^Af]B^j . (2.2) 

We recall that a set A £ £ is Jordan measurable if and only if its boundary has zero Lebesgue measure. 
Jordan measurable sets are denoted by J. We recall that J is closed under finite unions and finite intersections. 
Note that for A e J, 

li(A) = n(d(A)) . (2.3) 

Moreover for Bq, B n G J, 

(n \ n 

\jBt =Q (2.4) 

We denote by Z the subset of I consisting of regular compact sets. Notice that for any A, B <G 3, (A, B) = 
implies A = B, therefore d M is a metric on -J. In particular, the empty set <f> is in -J, and it is the only set in 
-J having zero measure. Note that by its definition -3 is closed under finite unions. 



2.2. Real-valued Bernstein approximation 

For a function / : [0, 1] — > R, the Bernstein polynomial of degree n is 

B n (f, X )=J2^ n i y i (l-x) n - i f (£) . (2.5) 

The mapping /(•)—> S„ (/, •) is called the Bernstein operator. An extensive exposition of Bernstein poly- 
nomials is given in [7]. 

Obviously one can interpret (2.5) as the weighted arithmetic average of the values / (^). The probabilistic 
nature of the Bernstein polynomials is also well known. It can be recognized by interpreting the weights, 

b(n,x;i) = ( " ) x l (l-x) n ~ l , (2.6) 

as point probabilities of a binomial distribution with parameters n and x. 

The polynomials B n (/, •) are the basis of Bernstein's proof of the Weierstrass Approximation Theorem 
([4], see [20] for a modern presentation). Using Bernstein polynomials the theorem can be formulated as 

Theorem 2.1. Let f : [0, 1] — > R be a continuous function, then for any e > there exists N > 0, such that 
for all n > N and all x € [0, 1] , 

|/(aO-B n (/,a;)|<e ■ 

A stronger version of the above theorem for Holder continuous functions is due to Mark Kac ([17, 18], 
see [21] for a modern presentation). We denote by Lip (L, v) the class of Holder continuous functions with 
exponent v and constant L, defined on [0, 1], namely functions satisfying, 

\f(x)-f(y)\<L\x-y\ u ,x,y e [0,1] . (2.7) 

Theorem 2.2. Let f £ Lip (L, v), then 

\f(x)-B n (f,x)\<L(^^iy 2 . 

Our adaptation of Bernstein operators to SVFs is based on the new average of sets introduced in Section 
3. To obtain the relevant properties of the new average of sets, we give it a probabilistic interpretation 
using the notion of a random closed set, which is discussed together with basic relevant results in the next 
subsection. 



2.3. Random sets 



We proceed with a few definitions regarding random sets. The following definitions and results are adapted 
from [23] , which provides a thorough account of random sets theory. 
Here we denote by JF the collection of closed subsets of W l . 

Definition 2.3. Let {SI, 5, Pr} be a probability space. A map X : O — > J- is called a random closed set, if 
for every compact set K C W n , 

€Q;X(u})f)K ^(fil . (2.8) 
In the sequel we assume that X is discretely distributed, namely, X(uj) £ {Aq, A n }, with Pr {X = Ai} = 

n 

ai > and a, = 1. Moreover, we assume that Ai £ 3, i = 0, ...n. Note that for any / : T — s- R and any 

t=o 

random set X, f (X) defines a real-valued random variable. 
Random closed sets X^, X n are said to be independent if, 

PT{X 1 £X 1 ,...,X n £X n } = Pv{X 1 £ X 1 }---Pr{X n £X n } , (2.9) 

for all X\ , . . . , X n £ 03 ( F) . Here 03 (J 7 ) is generated by all collections of closed sets of the form {F £ T : F f] K ^ <p} 
with K running through all compact subsets of W n ([23], Section 1.2). 

The coverage function px (•) : K m — > [0, 1] of the closed random set X is ([23], Section 2.2) 

px{u)=Vx{u£X] . (2.10) 

Notice that for discretely distributed random set X, 

px (u) = a - ■ ( 2 - U ) 

{i:u£Ai} 

The following relation is useful, 

J Px (u)du = E(fi{X)) , (2.12) 
where E denotes the expectation of a real-valued random variable. Clearly, for a discretely distributed X 

n 

accepting values {Ao, A n } the integral in (2.12) can be taken over (J Ai. 

i=0 

Let u £ l m , setting Xi = {F £ T : Ff] {u} ^ </>}, i = 0, ...,n one obtains from (2.9) that for independent 

Xl, ...Xn, 

Pr{u£X 1 ,...,u£X n }=p Xl (u)---Px n (u) . (2.13) 

In the next section we define a new average of sets, with which we adapt the Bernstein operators to SVFs 
and obtain results analogous to Theorems 2.1 and 2.2. 

3. The partition average of sets 

The construction of our average of sets is built upon several definitions. We begin with 
Definition 3.1. Let * : J x [0, 1] — > 3 be such that 

1. $(A,t) C A 

2. n(V(A,t))=tn(A) 

3. For s < t, * (A, s) C * (A, t) . 

The function ^ is called the subset-generating function. 

Note that since A £ J and * (A, t) £ 3, 

*(A,0) = <i>, *(A,l) = d(A) . (3.1) 

For any collection of sets in -3, we consider a special partition of their union to mutually disjoint sets, 
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Fig 1. The partition of the union of three subsets of R , the triangle, 
gray-tone belong to the same element of the partition of the union. 



the rectangle and the ellipse. Regions with similar 



Definition 3.2. Let {Aq, A n } C Z- For any subset \ of the indices {0, n}, we define the set 

^■■■^ = (n^)\( u A ■ ( 3 - 2 ) 

\ke X } \!£{0,..,n}\ X / 

For a fixed collection of sets {Aq, ...,A n } we use the shorthand notation fl x . The collection of sets 

{fi£>--^: X €2< n >} , 

where 2^ denotes all subsets of the set of integers {0, n} : is termed the partition of the union of Aq, A n . 
The sets Q x are termed elements of the partition. 

An example of the partition of the union of three subsets on R 2 is given in Figure 1. 

Here we state several properties of the partition of the union, that follow easily from Definition 3.2. 

Lemma 3.3. Let {A , A n } c 3 then 

1. n x el 

s. u n x = Aj . 

{xe2{™>:je x } 

I For a fixed X € 2^, x^<f>, [J fi x = f| A,-. 

{ x e2{"}:xc x } iex 

n 

5. U ^ x = U A 

Next observation connects the notion of the partition of union with random sets. The proof of this 
observation follows from Definition 3.2 and (2.11). 

Lemma 3.4. Let X be a random set, X (oj) £ {Aq, A n }, Pr {X = A{) = on. The coverage function, px (u) 
is constant over each element Q x of the partition of the union of Aq, ...,A n , and 

Px («)|n x =J2 a * ■ 

We are now in a position to define a new weighted average of sets in 3, which is based on the partition of 
the union of the averaged sets. 
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Definition 3.5. Let Aq, ...,A n € Z and ao,...,a n S [0,1], ^ = 1. The partition average of Aq, A Tl 



with the weights ai, ...,a n is 



i=0 



J^a^- (J $ fi X) ^aJ , (3.3) 

where ^ is a sub set- generating junction in Definition 3.1. 

Using the partition average, we can define expectation of a discretely distributed random set as 

Definition 3.6. Let X be a random set, X (ui) € {Aq, A n }, Pr (X — Ai) — a>i,i — 0, ...n. The partition 
expectation of X is 

E P (X) := <g) a,A, . (3.4) 

i=0 

Remark 3.7. In view of Lemma 3.4, the partition expectation is related to the coverage function through 

E P (X)= |J *(n x ,p x \ nx ) . (3.5) 

Next we state relevant properties of the partition average of sets. 
Theorem 3.8. In the notation of Definition 3.5, 

n 

1. ® aiAi e 3 

i=0 

n n 

2. For any permutation r(-) of{0,...,n}, <g> aiAi — - Cg) a r a\A r u\ 

i—0 i—Q 

n k 

3. If for some k € {1, n}, A^ = Ak + \ = ... = A n , then ® aiAi = ® P%Ai with /3j = ctj, i = 0, k — 1 

and (3k — - ^ ai. In particular, (g) a^A = A 
i=k i=0 

(n \ n n 

f| a t A t C ® C [J Ai 
{i:ai>0} J l -° {i:Ui>0} 

n 

5. If for some j , aj = 1, then ® aiAi = Aj 

i=0 



6. fJ, ® Q!j^4, = J] CKijU^) 
V=0 / j=0 

Proof. To obtain Property 1, observe that by Definition 3.1, ^ (fl x ,t) € J for any x € 2^ n \ t e [0,1], 
and recall that 3 is closed under finite unions. Properties 2,3, follow immediately from the definition of the 
partition average. 

Next we prove Property 4. Let x ™ {j € {0, ...,n} : ctj > 0}. Since ^ ay = 1, aj — 1 for x 2 X- 

jex j£x 

Therefore, from (3.1), (2.4) and Property 4 in Lemma 3.3, 

U *( fi x>E a *) = U ci(OJ=ci |J Q x j=cijf|A i 

{X:XCX> \ *6X / { X :xCx} \{x:xC X } / V^X 

/ \ n 

and thus ci f] Ai C ® o^^. The other part of Property 4, follows from the observation that © aiA; C 
U !!^U A,. 

{x:xHx#0} J'GX 

Property 5 is an immediate consequence of Property 4. Next we prove Property 6. From the definition 
of the partition average, from the fact that the sets {f2 x : x S 2^} are pairwise disjoint and from the 
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properties of the subset-generating function, we obtain that 



(3.6) 



To proceed with the proof of Property 6, we interpret ® aiAi as the partition expectation of a random set 

i=0 

X, such that Pr (X — Aj) — a,, i — 0, n. Now by Lemma 3.4 and by Properties 2, 5 of Lemma 3.3, 



E Jpx(u)du= J p x (u)du. 



(3.7) 



XS2W 



U ^ 

i=0 



Finally, we apply (2.12) to obtain that 

H ( <| otiAi J =E(fi (X)) = olhi (Ai) . 



□ 



Remark 3.9. The above properties of the partition average are analogous to those of weighted averages 
between non-negative numbers. In this analogy, the measure of a set replaces the absolute value of a number, 
the measure of the symmetric difference of two sets (d^(-; •)) replaces the absolute value of the difference 
between two numbers. Moreover, the intersection and union of sets replace the minimum and the maximum 
of numbers, and finally the relation C between sets replaces the relation < between numbers. 

The next theorem treats the distance between the partition expectations of two independent random sets 
distributed over the same collection of sets {Ao, A n }. 

Theorem 3.10. Let X\,X 2 be independent random sets, Pr {Xi = A{\ = ai, Pr {X2 = Ai} = Pi, i = 0, n, 

n n 

with "Yli cti = ^2 Pi = !• Then 

j=0 i=0 

d fl (Ep(X 1 ),E P {X 2 ))<E(d^(X 1 ,X 2 )) , 
where d^ (Xi,X 2 ) is the real-valued random variable d^ (Xi,X 2 ) = fx (XiAX 2 ), namely 

Pr(d M (Xi,X 2 ) =n(A i AA j )) = aif3j, i,j = 0,...,n . 
Proof. It follows from the definition of the partition expectation, and by the fact that the partition elements 

n 

are disjoint sets with their union equal to (J Ai (see Lemma 3.3), that 

i=0 

d fi (E P (X 1 ),E P (X 2 ))= d /* (* ("x'S" 4 I fl x>EA 



By the properties of the subset-generating function, for any x G 2^™^ one of the two sets I Q x , a* 1 , \& I fl x , /?, 
is necessarily contained in the other, and we get from (2.1), 



d fl (Ep(X 1 ),E P (X 2 ))= M(^x)E^"E^ 
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Now from Lemma 3.4 and Properties 2,5 of Lemma 3.3 we get 



d ll {Ep{X 1 ),E P {X 2 ))= J \ PXl {u)~ PX2 {u)\du . (3.8) 

On the other hand, 

E (d M (X 1 ,X 2 )) =E(ji (X 1 AX 2 )) = E (p (X X \X 2 )) + E (p (X 2 \X 1 )) . 
Since X\,X 2 are independent, 

Px lX x 2 («) = («) (! - Px 2 («)) , u G K m , 

and we obtain from (2.12), 

£7 0*(Jfi\X 2 ))= / PXl (u) {1 - PX2 {u j) du . 

U 

<=o 

Using similar observations for i5 (/it (X2\Xi)), we arrive at 

E{d l >(X x ,X i ))= f {p Xl (u)(l-px 2 (u))+p X2 (u)(l- PXl (u))}du . (3.9) 



U a, 

i=0 

It is easy to obtain the claim of the theorem, by inspecting the relations (3.8) and (3.9), since 

\a - b\ < a (1 - b) + b (1 - a) , a, b £ [0, 1] . (3.10) 

□ 

From the above theorem we obtain, 
Corollary 3.11. Let X be a random set, Pr {X = Ai] = on, i = 0, ...,n. Then 

d^ (E P (X) , Aj) = E (^ {X, A,)) , (3.11) 

for any j £ {0, ...,n}. 

Proof. Consider the random set X, Pr |jf = = <5.y, i = 0, ..,n with <5y = 1 for i = j and 5^j = 
otherwise. By Property 5 of Theorem 3.8, Ep (x) = Aj, thus from Theorem 3.9, 



d^ {Ep (X) , Aj) < Ed^ (X, A,) . (3.12) 

Since p x (u) £ {0, 1}, if follows from (3.10), that there is an equality in (3.12). □ 
Approximation results in the next section are based upon the following corollary, which is derived from 
Corollary 3.11. 

n 

Corollary 3.12. Let Ao,.,.,A n £$ and ao,...,a n £ [0,1], on = 1, then 

i=0 

d„ [Aj, ® ctiAi ) =^«i^ (A/.Ai) . (3.13) 



i=0 



Namely, the distance to the partition average from any of the averaged sets is egual to the average of the 
distances from this set to all the averaged sets. 



Remark 3.13. Relation (3.13) is reminiscent of the relation, 
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<j2oi\r-pi,\- (3-14) 



r - 2J UiPi 

8=0 



where r,Pi, i = 0, ...,n are numbers. However, notice that there is equality in (3.13) versus inequality in 
(3. 14-). Moreover, observe that (3.13) applies only to the sets participating in the partition average, while 
(3.14) applies to any r£l. This limitation has implications to the approximation power of methods based 
on the partition average. 

The partition average of two sets possesses also the metric property [9] relative to d^ (•, •). 

Corollary 3.14. Let A ,A X e J, a ,f3 £ [0,1], then 

(a A ® (1 - a ) Ax, f3 A ®(l-l3o) Ax) = \a - p \ d^iAo, Ax) . (3.15) 

Proof. Let X\, X 2 be random sets such that Pr (X\ — Ai) = ai and Pr (A 2 = Ai) = fa, i = 0, 1. Note that 
ai =l-a ,(3x = l-[3 . By (3.8), 



d„(E P (Xx),E P (X 2 ))= / \p Xl {u)-p X2 {u)\du . 



Since for u <f_ AAB, px x (u) — px 2 ( M )i we get 

d„(aoAo®(l-a )Ax,f3oA ®(l-f3o)Ax)= / \a - f3 \du = \a - A)| (AAB) 



AAB 

□ 

To complete the construction of the partition average of sets, we need to provide a concrete example of a 
subset-generating function in Definition 3.1. We denote by Bl (p, r) a ball of radius r about p £ E m , namely 

Bl (p,r) = {q £ E m : ||«-p|| <r} , 
with || • || the Euclidean norm on K m . The subset-generating function $ : J x [0, 1] — > 3 is defined by 

4- (A,t) =01(5/(^,01^ A) , (3.16) 

where r&,t is chosen so that ^ (^ (A, t)) = tp, {A). The existence of r^,t as above for any t £ [0, 1] follows 
from the continuity of the volume of the ball as a function of its radius. An example of the partition average 
with a such defined subset-generating function ^ is shown in Figure 2. In this example, p is the centroid of 
the union of the averaged sets. 

Although there is a significant resemblance between the partition average of sets and the weighted average 
of numbers as is noticed in Remarks 3.9 and 3.13, the partition average of sets lacks several important 
properties of the weighted average of numbers. 



Remark 3.15. The partition average is generally not associative, 

2 

i=0 



* («o+ai)(^A ®^A 1 )® a2 A 2 
^ a A ®{ax+a 2 )(-^Ax®^A 2 ) 



Remark 3.16. Zero-weighted sets in (3.3) affect the partition average by affecting the partition of the union 
of all the sets, namely, 

n n+1 

(g) atAi ^ a l A l , 

i=0 i=0 

n 

with Y] 04 = 1 and a„+i = 0. Yet, the average "is between the intersection and the union of the sets with 

i=0 

positive weights" (see Property 4 in Theorem 3.8). 



Fig 2. The partition average (g> c*i Ai, with on = = , i = 0, 1, 2 and Ai, i = 0, 1, 2 the three sets in Figure 1. 



4. Set-valued Bernstein approximation based on the partition average 

Using the partition average of sets defined in the previous section, we can now define a set- valued operator 
analogous to (2.5), 

Definition 4.1. The the set-valued Bernstein operator is the mapping F (•) — > B n (F, •) given by 



for any F : [0, 1] — > 3, where b (n, x; i) are defined in (2.6). 

Using Definition 4.1, we aim to obtain approximation results analogous to Theorem 2.1 and Theorem 2.2. 
First we note that by Property 5 in Theorem 3.8 and by (2.6), B n (F,0) = F(0),B n (F,l) = F(l). The 
set- valued version of Theorem 2.1 is 

Theorem 4.2. Let F : [0, 1] — > J be a continuous SVF, then for any e > there exists N > 0, such that for 
all n > N and all x € [0, 1], 



The proof of Theorem 4.2 is based on the following two lemmas. 

Lemma 4.3. In the notation of Definition J^.l, let x £ [0, 1] and let x' be the point closest to x among -, 
i = 0, n. Then 




(4.1) 



d n (F (x) , B n (F, x)) < e . 



(4.2) 



n 



d^ (F (x) , B n (F, x)) < 2d u (F (:r/) , F (x)) + ^ 6 (n, x; i) d^ (F (x) , F (a*)) • 



(4.3) 



Proof. By the triangle inequality, 



d„ (F (x) , B n (F, x)) < d^ (F (x) , F (x')) + d„ (F (x') , B n (F, x)) 



(4.4) 



We obtain from Corollary 3.12 that 



n 



d„ (F {x') , B n (F, x)) = ^b (n, x- i) d„ (F {x') , F ( Xi )) , 



(4.5) 



i=0 
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and by the triangle inequality, 

n n 

b (n, x; i) (F (x') , F (a*)) < d M (F (x') , F (x j) + £ b (n, x; i) d„ (F (x) , F (a*)) . 

i=0 i=0 

This together with (4.4) and (4.5) completes the proof. □ 

Lemma 4.4. Let F : [0, 1] — > 3 be a continuous function, then for any e > there exists N > 0, such that 
for all n > N and all x € [0, 1] , 

n 

^b(n,x;i)d fi (F{x),F{x i )) <e . (4.6) 

Proof. For any x € [0, 1], consider the function g x (•) : [0, 1] — s- M, 

fe (y)=^(F( a ;) ) F(2 / )) . (4.7) 

It is easy to observe that due to the continuity of F, the family of functions {g x (•) : x € [0, 1]} is uniformly 
equicontinuous and uniformly bounded. 

n 

Next we note that ^ 6 (n, x; i) d^ (F (x) ,F{xi)) is B n (g Xl x). Since g x (x) = 0, we use Theorem 2.1 to 

i=o 

conclude that there exists N x > 0, such that (4.6) holds for all n > N x . By the uniform continuity and 
boundedness of {g x } xe ^ ^ there exists N < oo such that N x < N for all x € [0, 1]. □ 

Proof, of Theorem By Lemma 4.4, there exists a positive integer N\ s.t. for n > N%, 



J2b{n,x;i)d M {F(x),F(xi)) < \ , x G [0, 1] 

i=0 



By continuity of F, there exists a positive integer iV2 s.t. for \x — y\ < j^, i,y£ [0, 1], d M (F (x) , F (y)) < |. 
We set ./V = max {N\, N2} and apply Lemma 4.3 to obtain the claim of the theorem. □ 
Next we consider the rate of approximation of Holder continuous SVFs by the set-valued Bernstein 
operator. The class of Holder continuous SVFs, Lip (L, v) : is defined as in (2.7), using d^ (•, •) instead of the 
distance between numbers. 

Theorem 4.5. Let F e Lip (L, v), then for all n S N and all x £ [0, 1], 

d IM (F(x),B n (F,x))<L(^\ +l( X{1 ~ X) ^ 1 ■ (4.8) 

Proof. For any x £ [0, 1], let the function g x (•) be given by (4.7). F g Lip (L, v) implies that g x G Lip (L, v) 
for all x S [0, 1]. Since g x (x) = 0, we get from Theorem 2.2, 

n , /, _ \ \ u/2 
^b(n,x;i)d l ,(F(x),F( Xi )) = B n (g x ,x)<L( X ± ^ . (4.9) 

i=0 \ n J 

From F £ Lip {L, v), (4.9) and Lemma 4.3, we obtain the result of the theorem. □ 

x ^ ~ x ' ) , so the results 

obtained in Theorem 2.2 and Theorem 4.5 are asymptotically equivalent. 
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5. Set-valued Bernstein approximation with the de Casteljau's algorithm 

A widely used method for the evaluation of the real- valued Bernstein operators B n (/, x) is the de Casteljau's 
algorithm (see, e.g. [14], Chapter 4). The algorithm evaluates B n (/, x) through a sequence of averages of 
two numbers {binary averages) and is based on the following recurrence relation, 

b (n, x; i) = (1 — x) b (n — 1, x; i) + xb (n — 1, x;i — 1) , (5-1) 

where b(n,x;i) are given in (2.6). B n (f,x) in (2.5) can be represented using (5.1) as, 

n n—X 

B n (/, x) = b (n, x; i) f? = £ b (n - 1, x; i)^ 1 , (5.2) 

with 

IT = f (£) , i = 0, ...,n and /f" 1 = (1 - i) + , i = 0, 1, n - 1 . (5.3) 

The de Casteljau's algorithm repeats this recursion n times to get 

/''»;/..<•: ./;',. (5.4) 

In the real-valued case, (2.5) and the recursive relations (5.2)-(5.4) are equivalent, though the evaluation of 
Bn {f: x ) by the de Casteljau's algorithm is numerically stable. 

A straightforward adaptation to SVFs of the recursive relations (5.2)-(5.4) based on the partition average 

is 

if = (l-x)F? +1 ®xF£ff, i = 0,...k, fc = n-l,...,0 , (5.5) 

with F™ — F (a), and B n (F,x) is set to be F§. Note that by Remark 3.15, this adaptation yields a set- 
valued operator which is different from that in (4.1). The above construction is similar to that in [13] with 
the metric average. Similarly to [13], we do not expect that for a general SVF F, B n (F, x) converges to F (x) 
as n — > oo. 

We now alter the adaptation and apply Corollary 3.12, to obtain approximation results similar to Theorems 
4.2 and 4.5 also in the case of the de Casteljau's representation of the Bernstein operators. To this end we 
use a binary average, based on the partition determined by F (i), i = 0, n, of the form 

n+2 

\A® (1 - A) B = ® fcE, , (5.6) 

i=0 

where E t = F h i = 0, n, E n+1 = A, E n +2 = B and ft = 0, % = 0, n, /3 n+1 = A, /3 rl+2 = 1 - A. Then we 
apply the de Casteljau's algorithm with this average, namely 

Ft = (l-x)F t k+1 ®xFf+ 1 1 , i = 0,...k, k = n- 1,...,0, (5.7) 

and set, 

B° c (F,x)=F° . (5.8) 
Since F (— ), i = 0, n, is in the partition behind <§), we get from (5.7) and Corollary 3.12, that 

d A F o^(i)) - d ll ((l-x)F^xFl,F{i)) 

= (l~x)d,(Fi,F(£)+xd,(F 1 \F(^) . 

Continuing the recursion we finally obtain the real- valued de Casteljau's algorithm for the function g± defined 
in (4.7), namely with the initial data 

Therefore by (5.8) we have for i — 0, ...,n, 

^(bS c {F,x),f(^ ^M^ijl^^Q.f^jj^elO,!] • (5.9) 

The equality (5.9) is the same as (4.5), and therefore Theorems 4.2 and 4.5 also apply to the set-valued 
Bernstein operators defined by the de Casteljau's algorithm with the binary average (5.6). 
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6. Approximation of SVFs by positive sample-based operators 



We consider the adaptation to SVFs of families of positive sample-based operators for real-valued functions, 
denned for n € N as 

O n (/, x) = c„ :i (x) f (x nji ) , (6.1) 

In 

where / : [0, 1] K, x £ [0,1], = x n . < x n ,i < ... < x ndn = 1, c nS (x) > and £j c Wi j (ar) = 1. 

Moreover, we denote (5„ (x) = min \x — x n i\ and assume that for any x G [0, 1], lim <5„ (a;) = 0. The 

ie{o,...,z„} n->oo 

real-valued Bernstein approximation operators are a prominent example of a family of operators as above. 

Other examples are the piecewise linear interpolation operator and the Schoenberg spline operators [26] . 

In analogy to the adaptation of the Bernstein operators to SVFs, we define for F : [0, 1] — > 3, 

O n (F,x)= ® c n:l (x)F{x n .i) . (6.2) 

i=0 

The first result is obtained immediately due to Property 6 in Theorem 3.8 of the partition average, 
Corollary 6.1. Let O n and O n be as in (6.1) and (6.2) respectively. Then 

pi (O n (F, x)) = O n (n (F (x)) ,x) , X € [0, 1] . 

Next we extend the results obtained in Theorems 4.2 and 4.5 for the Bernstein operators. Using the 
method of proof in Theorem 4.2 we obtain 

Corollary 6.2. Let O n and O n be as in (6.1) and (6.2) respectively. Assume that for any continuous real- 
valued function f : [0, 1] — > R and any e > 0, there exists Nf tti such that for all n > Nf^ and all x G [0, 1], 



f(x)-O n (f,x) 



< e 



Then for any continuous SVF, F : [0, 1] — > 3,- and any e > 0, there exists Np,e such that for all n > Np_ e 
and all x € [0, 1], 

d fi (F(x),O n (F,x))<e. 

For Holder continuous SVFs, we obtain by arguments similar to those in the proof of Theorem 4.5, 

Corollary 6.3. Let O n and O n be as in (6.1) and (6.2) respectively. Define the approximation error of O n 
to functions in Lip (L, v) at x € [0, 1] as 

e n .L.» (x) = sup O n (/, x) - f (x) 
fehip(u,L) 

Then, 

sup d a (O n (F, x) , F (x)) < e n .L, v (x) + LS„ (x) v . 

FeLip(i/,L) 

Approximation of continuous functions by positive operators are discussed in ([15], Chapter VII, §1) in 
the context of probability theory, while approximation results for Holder continuous real-valued functions 
are the subject of [22]. 
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7. Approximation of monotone SVFs 

Next we obtain several results specific to the approximation of monotone SVFs by positive sample-based 
operators. We begin with a simple condition for the monotonicity preservation by positive sample-based 
operators for real-valued functions. 

n n 

Lemma 7.1. Let a.Q,...,ot n G [0:1]; Poi-~>Pn € [0,1], ^ a * = 12 Pi = 1j such that for any integer k, 

i=Q i=0 

< k < n, 

n n 
i—k i—k 

The condition (7.1) is necessary and sufficient for the inequality, 

n n 

® a % r, < ® , (7.2) 

i=0 i=0 

to hold for any monotone non- decreasing sequence of numbers tq, ...,r n (E IR. 

Proof. The necessity follows by considering sequences of the form r = r\ = ... = r& < rj. +1 = r^ + 2 ••• — r n- 
The sufficiency can be obtained by setting, 

3 

r.j = A + 22 A 2 , j = 0, n , 

i=i 

where Ao = ro, Aj = r< — r,_i > 0, i = 1, ...,n, and considering the contribution of each Aj in (7.2). □ 
In case ro, ■■■,r n is monotone non-increasing, condition (7.1) implies by symmetry, that 

n n 
i=0 i=0 

Corollary 7.2. Lei O n (F,x) be a positive sample-based operator defined as in (6.2), such that for any 
x,y € [0, l],x < y, the weights on — c 7h i(x), Pi = c n ^(y) satisfy (7.1). Then O n is monotonicity preserving. 

Next we show that similar conditions are necessary and sufficient for the monotonicity preservation by 
positive sample-based operators for SVFs. A sequence of sets {Fi} igZ is termed monotone non-decreasing 
(non-increasing), if for all i, Fi C Fi+i (F{ D -Fi+i)- Monotone non-decreasing (non-increasing) SVFs are 
defined in a similar way. 

Lemma 7.3. Let ao, a n , /3q, f3 n be as in Lemma 7.1, then condition (7.1) is necessary and sufficient 
for the relation, 

n n 

® a t A t C ® (SiAi , 

i=0 i=0 

to hold for any monotone non- decreasing sequence of sets Aq, A n G J. 

Proof. In view of Property 3 of Theorem 3.8, the necessity follows by considering sequences of the form 
Aq = Ai = ... = Ak C Ak+\ = Ak+2--- = A n . To obtain the sufficiency, assume that Aq, ...,A n is monotone 
non-decreasing and let Sl x be as in Definition 3.2. We observe that due to the monotonicity of the sequence 
Ao,...,A n , if O x 7^ then necessarily \ = {k,k + 1, ...,n} for some integer k, < k < n. Using (7.1) and 
Definition 3.1 we obtain that for £l x as above, 

[Q x ] a . !_ [O x ] j2 p i , 

which in view of Definition 3.5 of the partition average completes the proof of the lemma. □ 
In case Aq, ...,A n is monotone non-increasing, condition (7.1) implies by symmetry, that 

n _ n 

® otiAi D ® PiAi . 

2=0 i=Q 
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Corollary 7.4. Let O n (F,x) be a positive sample-based operator defined as in (6.2), such that for any 
x,y € [0, l],x < y, the weights ai — c n> i{x), pi = c n ^(y) satisfy (7.1). Then O n is monotonicity preserving. 

From Corollaries 7.2 and 7.4 we conclude 

Corollary 7.5. Let O n and O n be defined as in (6.1) and (6.2) respectively . Then O n is monotonicity 
preserving if and only if O n is monotonicity preserving. 

To obtain from (7.1) that the Bernstein approximation operators are monotonicity preserving, we need 
to show that for < X\ < x% < 1, < k < rij 

n n 

^2 b (n, x x ; i) < ^ b (n, x 2 ;i) . (7.3) 

i—k i—k 

This can be observed from the properties of the cumulative binomial distribution [28] , 

k 1 — x 

(», x; i) = (n - *) ( I ) f t n - k - 1 (1 - t) k dt , (7.4) 



i=0 







which is clearly monotone non-increasing in x and thus leads to (7.3). 

To continue the discussion, we recall the notion of the speed of a curve in a metric space (see e.g. [5], 
Chapter 2), which indicates the "smoothness" of a set- valued function. For a real- valued / the speed at a 
point x is 

( s v \f{x)~f{x + e)\ 

v ' {x) = £3 jij ' 

whenever the limit exists. For differentiable /, Vf is the absolute value of the derivative of /. The speed of 
a SVF F is defined as 

d^ (F(x),F(x + e)) 

vp (x) — lim — ;— : . 

e^O \e\ 

Using relation (2.1) and Corollary 6.1, we obtain 

Corollary 7.6. Let O n and O n be monotonicity preserving operators defined as above. Then for a monotone 
SVF F, the speed of O n (F, •) equals that of O n (fi (F) , ■). 

In particular, for the Bernstein set-valued operators applied to a monotone SVF F, the speed of B n (F, •) 
is a polynomial, since B n (/i (F) , •) is a monotone polynomial and therefore its speed is a polynomial too. 

8. Approximation of multivariate SVFs 

The results of Section 6 can be generalized to SVFs defined on a compact subset K of M. d . In this case we 
adapt to SVFs families of positive sample-based operators of the form, 

In 

On (f,p) = J2 c n4 (p) f ( Pn>i ) , P eK,neN, (8.1) 

i=0 

In 

where / : K — ¥ M, p n ,i G K, i — 0, l n , c n ^(p) > and °n,i (p) = 1- I n analogy with the univariate case, 

i=0 

we define 5 n (p) = min lip — p n A I, and assume that for any p € K, 
ie{o,...,J„} 

lim 6 n (p) = . (8.2) 

n— >oo 

Notice that (8.1) includes many well known families of approximation operators. Some examples are the 
approximation by tensor product Bernstein polynomials, tensor product Schoenberg splines operators ([6], 
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Chapter XVII) and multivariate Bernstein polynomials on simplices (see e.g. [22]). Similarly to the univariate 
case, we define for F : K — > 

O n (F,p) = ® c n ^(p)F(p n ^) . (8.3) 
i=0 

With definitions (8.1)-(8.3), the analogs of Corollaries 6.1, 6.2 and 6.3 for multivariate SVFs are easily 
derived. 

As an example of the application of (8.3), we consider the adaptation to SVFs of the piecewise linear 
interpolation over triangulations (see, e.g. [14], Chapter 3). We briefly recall that for a collection of points 
P = {pi, ...,pz} C M 2 , the triangulation T of P is a collection of triangles such that 

• The vertices of the triangles consist of points in P. 

• The interiors of any two triangles do not intersect. 

• If two triangles are not disjoint, then they share either a vertex or an edge. 

• No edge can be added between points in P without intersecting an edge of one of the triangles in T. 

In the notation of (8.1), let P = {po i--»Po.i } C M 2 , and let K be the convex hull of the points in P . 
Assume that the sequence {-Pn} ngN * s nes ted, Pq C Pi C Pi-... Let T n be a triangulation of P„, such that 

lim A„ = , 

n— > oo 

where A„ = max diam (T) and diam (T) is the diameter of the circumscribed circle of T. Note that for such 

a sequence of triangulations condition (8.2) is satisfied. 

Let the triangle A = PnfaiPnMiPnM De i n L„ and let p £ A. The piecewise linear interpolant L n (f,p) 
is defined as in (8.1) with the weights c n ^ (p) given by 



/ s _ area(p,p n , k2 ,p nyk3 ) , , _ area(p n , kl ,p,p n ,k 3 ) 

n,fei XPJ area(p ntkl ,Pn.,k 2 >Pn,k 3 ) ' n,k 2 \P) area{p„,h t ,Pn,k 2 ,Pn,h 3 ) ' 

I \ _ a'-ea(p„,fc 1 ,p„,fc 2 ,p) 
n,k 3 \P) area{p n<kl ,Pn,k 2 ,Pn,k 3 )'' 



(8.4) 



and 

c„, l (p) = 0, i<£ {ki,k 2 ,k 3 } . (8.5) 
It is easy to verify that for a continuous /, 

lim L n (f,p) = f(p) , (8.6) 

n— >oo 

and for / G Lip (L, z/), 

f(p)-L n (f,p)\ <LA» . (8.7) 

Similarly, for a SVF F, the piecewise interpolant L n (F,p) is defined as in (8.3), with the weights c n ^{p) given 
by (8.4)-(8.5). Using Corollaries 6.2 and 6.3 extended to multivariate SVFs, we obtain that for a continuous 
F 

lim L n (F,p)=F(p) , (8.8) 

n—>oo 

and for F G Lip (L, v), 

\F(p)-L n (F,p)\<2LA». (8.9) 

Note that in the real- valued case the zero- weighted summands in (8.1) do not affect the result, but this is not 

l„ 

so in the set-valued case. More precisely, let L n (F,p) = ® c n i(p)F (p n A. Then in view of Remark 

i=0, c„ i^O 

3.16, 

L n (F,p)^L n (F,p) . 

Moreover, while L n (F,p) is continuous by its definition, L n (F,p) is discontinuous in view of Remark 3.16. 
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9. Approximation of functions with values in general metric spaces 



Finally, we extend the approximation results for functions with values in the metric space {Zjd^} and the 
partition average to functions with values in general metric spaces endowed with an average satisfying certain 
properties. 

Let {X, dx} be a metric space, and let EH be an average on elements of X defined for non-negative weights. 
Assume that the average EH satisfies the conditions, that for any Ao,...,A„ G X and ao,...,a n G [0,1], 

n 

X) Oti = 1, 



n f n \ ^ — T 

33 a;Ai G X and d x [Aj, EH a t A, < ^ a % dx {Aj, A*) , j G {0, .., n} . (9.1) 



i=0 



i=0 

Let (9„ be defined by (8.1), we define for G : K — > X, 

n 

O n (F,x)= EH (at) G (x n ,i) ■ 

i—0 

With these definitions, it is straightforwardly to obtain approximation results similar to Corollaries 6.2 and 
6.3. 

To characterize metrics spaces, in which averages satisfying the relation (9.1) can be constructed, we 
observe that (9.1) is equivalent to the condition that 



ffl aiAi G f)Bl [ Ai^Oidx (Ai.Aj) ] , (9.2) 

3=0 



i=Q 



where £?/ (A, r) is the metric ball of radius r centered at A. 

Therefore, we say that a metric space is strongly convex, if for any Ao, A„ G X and olq, ■■■,a n G [0,1], 

n 

53 = 1, the set 

i=0 

n / n 

$(A ,...,A n ;ao, ...,(*„) = BI A.^a^x (A^A^) 

»=o \ j=o 

is not empty. Notice that for n — 1, the above definition coincides with the definition of a convex metric 
space in the sense of Menger (see, e.g., [25], Chapter 2). In a strongly convex metric space X one can define 

n 

the average of any Ao, A„ G X with the weights ao,...,a n G [0,1], a i — 1 as an y element in the set 

i=0 

$ (A , A„; ao, ...,a„) . 
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